3574 results found.
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
6300 sentences Production Status:
Existing-used
Use:
speech enhancement
-
Paper title:Incorporating Symbolic Sequential Modeling for Speech Enhancement
-
Paper track:6.4 Speech enhancement: single-channel/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Chien-Feng Liao | TIMIT Acoustic-Phonetic Continuous Speech Corpus | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
200 hours Production Status:
Existing-used
Use:
Machine Learning
-
Paper title:Unsupervised Singing Voice Conversion
-
Paper track:7.11 Synthesis of singing voices/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Lior Wolf | stanford DAMP | /N |
Documentation:
None
Multimodal/Multimedia
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CC ShareAlike
Size:
85 GByte Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Towards Bilingual Lexicon Discovery From Visually Grounded Speech Audio
-
Paper track:10.8 Zero-resource speech recognition/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Emmanuel Azuh | Places Audio Caption Corpus | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
753 MByte Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:NIESR: Nuisance Invariant End-to-end Speech Recognition
-
Paper track:8.3 Robustness against noise or reverberation/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | I-Hung Hsu | TIMIT Acoustic-Phonetic Continuous Speech Corpus | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
License:
Size:
5.5 GByte Production Status:
Existing-used
Use:
Acquisition
-
Paper title:Privacy-preserving Variational Information Feature Extraction for Domestic Activity Monitoring Versus Speaker Identification
-
Paper track:13.9 Privacy in Speech and Audio Interfaces/Poster Presentation
-
Paper status:Accept Special Session
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Alexandru Nelus | Wall Street Journal (WSJ) Corpus | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
2 GByte Production Status:
Existing-used
Use:
Voice Control
-
Paper title:A Time Delay Neural Network with Shared Weight Self-Attention for Small-Footprint Keyword Spotting
-
Paper track:9.7 Computational resource constrained speech reco/Poster Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ye Bai | Speech commands v1 | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Bilingual
Languages:
English Spanish
Availability:
Freely Available
License:
CreativeCommons Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Size:
1.7 GByte Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Prosodic Phrase Alignment for Machine Dubbing
-
Paper track:12.19 Other topics in Spoken Language Processing: /Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Alp Öktem | Heroes Corpus | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
Size:
500 MByte Production Status:
Existing-used
Use:
Emotion Recognition/Generation
-
Paper title:A Saliency-based Attention LSTM Model for Cognitive Load Classification from Speech
-
Paper track:3.3 Automatic analysis of speaker states/Poster Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ascension Gallardo-Antolin | Cognitive Load with Speech and EGG (CSLE) database | /N |
Documentation:
The documentation is publicly available in: “Speech production under cognitive load: Effects and classification”, T. F. Yap, Dissertation, The University of New South Wales, 2012 and “The INTERSPEECH 2014 Computational Paralinguistics Challenge: Cognitive & Physical Load”, B. Schuller, S. Steidl, A. Batliner, J. Epps, F. Eyben, F. Ringeval, E. Marchi, Y. Zhang, Proceedings INTERSPEECH 2014, ISCA, Singapore, Singapore, 2014.
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Specified in the ReMASC User Agreement
Size:
~50 GByte Production Status:
Newly created-finished
Use:
Speech Processing System Anti-spoofing
-
Paper title:ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems
-
Paper track:5.12 Other topics in Analysis of Speech and Audio /Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yuan Gong | ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems | /N |
Documentation:
Yes. The documentation is in English and publicly available.
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Singapore Open Data License v1.0
Size:
534 GByte Production Status:
Newly created-in progress
Use:
Machine Learning
-
Paper title:Building the Singapore English National Speech Corpus
-
Paper track:12.6 Speech and multimodal resources/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Kevin Khoo | National Speech Corpus (Singapore) | /N |
Documentation:
None




